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Abstract 

We introduce a new statistical quantity the energy to test whether two samples 
originate from the same distributions. The energy is a simple logarithmic function 
of the distances of the observations in the variate space. The distribution of the test 
statistic is determined by a resampling method. The power of the energy test in one 
dimension was studied for a variety of different test samples and compared to several 
nonparametric tests. In two and four dimensions a comparison was performed with the 
Friedman-Rafsky and nearest neighbor tests. The two-sample energy test was shown 
to be especially powerful in multidimensional applications. 

1 INTRODUCTION 

Let Xi, X2, . . . , X„ and Yi, Y2, . . . , Y„i be two samples of independent random vectors 
with distributions F and G, respectively. The classical two-sample problem then consists 
of testing the hypothesis 

Hq : F{x) ^G{x.), for every X eM'', 
against the general alternative 

Hi : F{x.) ^ G'(x), for at least one x eM'', 

where the distribution functions F and G are unknown. 

Testing whether two samples are consistent with a single unknown distribution is a task 
that occurs in many areas of research. A natural and simple approach is to compare the 
first two moments of the sample which measure location and scale. Many tests of this type 
can be found in the literature Duran (1976)| Conover et al. (1981)| Buening (1991)| but 



distributions may differ in a more subtle way. Other tests require binning of data like 



the power-divergence statistic test Read and Cressie (1988) and tests of the x type- 



However, a high dimensional space is essentially empty, as is expressed in the literature 
by the term curse-of-dimensionality | Scott (1992)| , hence these tests are rather inefficient 
unless the sample sizes are large. Binning-free tests based on rank statistics are restricted 
to univariate distributions, and, when applied to the marginal distributions, they neglect 



correlations. The extension of the Wald-Wolfowitz run test I Wald and Wolfowitz (1940j 
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and the nearest neighbor test Henze (1988)| avoid these caveats but it is not obvious 



that they are sensitive to aU kind of differences in the parent distributions from which the 
samples are drawn. 

In this paper we propose a new test for the two-sample problem - the energy test - which 
shows high performance independent of the dimension of the variate space and which is easy 
to implement. Our test is related to Bowman- Foster test | Bowman and Foster (1993) but 
whereas this test is based on probability density estimation and local comparison, the energy 
test explores long range correlations. 

In Section 2 we define the test statistic Even though the energy test has been 

designed for multivariate applications, we apply it in Section 3 to univariate samples be- 
cause there a unbiased comparison to several well established univariate tests is easily 
possible. A selection of examples and tests investigated by Buening (1999)| are con sid- 



ered. These are the Kolmogorov-Smirnov, Cramer-von Mises, Wilcox | Wilcox (1997)| and 
Lepage Lepage (1971)| tests. We have added the test with equal probability bins. 

In Section 4 we study the power of the energy test in two and four dimensions and 
compare it to the Friedman- Raf sky | Friedman and Rafsky (1979)| and the nearest neighbor 
tests. 

We conclude in Section 5 with a short summary. 



2 THE TWO-SAMPLE ENERGY TEST 

The basic idea behind using the quantity energy to test the compatibility of two samples 
is simple. We consider the sample A : Xi,X2,...,X„ as a system of positive charges 
of charge 1/n each, and the second sample B : Yi , Y2 , . . . , Y„i as a system of negative 
charges of charge — 1/m. The charges are normalized such that each sample contains a 
total charge of one unit. From electrostatics we know that in the limit of where n, m tend 
to infinity, the total potential energy of the combined samples computed for a potential 
following a one-over-distance law will be minimum if both charge samples have the same 
distribution. The energy test generalizes these conditions. For the two-sample test we use 
a logarithmic potential in W^. In the Appendix we show that also in this case, large values 
of energy indicate significant deviations between the parent populations of the two samples. 



2.1 The test statistic 

The test statistic ^nm consists of three terms, which correspond to the energies of samples 
A {^a), B {^b) and the interaction energy ($ab) of the two samples 

1 " 

i<j 
^ m 

i<j 
^ n m 

*AB = ^^i?(|x,-y,|), 

nm ^ — ' ^ — ' 

where R{r) is a continuous, monotonic decreasing function of the Euclidean distance r 
between the charges. The choice of R may be adjusted to a specific statistical problem. 
For the present analysis, we select R{r) = — Inr instead of the electrostatic potential 1/r. 
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With this choice the test is scale invariant and offers a good rejection power against many 
alternatives to the null hypothesis. 

To compute the power of the new two-sample energy test we use the permutation method 



Efron and Tibshirani (1993) to evaluate the distribution of <i>„m under Hq. We merge 
the N = m + n observations of both samples and draw from the combined sample a 
subsample of size n without replacement. The remaining m observations represent a second 
sample. The probability distribution under Hq of is evaluated by determining the 
values of of all (^) = ^^^^ possible permutations. For large N this procedure can 
become computationally too laborious. Then the probability distribution is estimated from 
a random sample of all possible permutations. 

2.2 Normalization of the distance 

The Euclidean distances between two observations z, and z' in R"^ is 



|z,; - z' 1 = 



with projections and z^^,, k — 1, . . . , d of the vectors z^ and z^ . 

Since the relative scale of the different variates usually is arbitrary we propose to nor- 
malize the projections by the following transformation 

• _ Zjk — fJ-k i = l,...,n 

~ ak k = I, . . . ,d 

where /ifc, a^. are mean value and standard deviation of the projection zik, ■ ■ ■ , Znk of the 
coordinates of the observations of the pooled sample. In this way we avoid that a single 
projection dominates the value of the energy and that other projections contribute only 
marginally to it. 

We did not apply this transformation in the present study, because this might have 
biased the comparison with other tests and because the different variates had similar vari- 



3 POWER COMPARISONS 

The performance of various tests were assessed for finite sample sizes by Monte Carlo 
simulations in d = 1, 2 and 4 dimensions. Also the critical values of all considered tests 
were calculated by Monte Carlo simulation. We chose a 5% significance level. 

For the null hypothesis we determine the distribution of $nm with the permutation 
technique, as mentioned above. We followed | Efron and Tibshirani (1993) and generated 



1000 randomly selected two subsets in each case and determined the critical values (jic of 
(j)nm- For the specific case n = m = 50 and samples drawn from a uniform distribution we 
studied the statistical fluctuations. Transforming the confidence interval of (pc into limits 
for a, we obtain the interval [0.036,0.063], see TableQ] 

3.1 One dimensional case 

Even though the energy test has been designed for multivariate applications, we investi- 
gate its power in one dimension because there a comparison with several well established 
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Table 1: Confidence intervals as a function of the number of permutations for nominal 
a = 0.05. 



# of^ 

permutations 


CL(95%) for a 


100 


[0.006,0.095] 


300 


[0.025,0.075] 


500 


[0.031,0.068] 


1000 


[0.036,0.063] 



tests is possible . To avoid a personal bias we drew the two samples from the probability 
distributions, which have also been investigated by | Buening (2001) : 



1 -\/3 < a; < V3 
otherwise 



1 



1 ~\x\ 

1 1 



TT 1 + 



Cauchy 



x> -l 



with 3 degrees of freedom, 
transformed to mean 0, variance 1 



iiV(l.5,l) + iiV(-l.5,l) 

0.8A^(0,1 
1 



0.27V(0,42) 
2iV(l,22) + iiV(-l,l) 



This set /i to /g of probability distributions covers a variety of cases of short tailed up 
to very long tailed probability distributions as well as skewed ones. 

To evaluate the power of the tests we generated 1000 pairs of samples for small n = 
m = 25, moderate n — 50, m = 40 and "large" n = 100, m = 50, for seven different 
scenarios. We have transformed the variates Y* — 9 + rYj, j — 1, . . . ,to of the second 
sample, corresponding to the alternative distribution, with different location parameters 9 
and scale parameters t. Powers were simulated in all cases by counting the number of times 
a test resulted in a rejection divided by 1000. All tests have a nominal significance level of 
0.05. 

Table |21 shows the estimated power for small sample sizes, n — 25, m = 25, of the 
selected tests. These are the Kolmogorov-Smirnov (KS), Cramer- von Mises (CvM), Wilcox 
(W), Lepage (L). We have added the test with 5 equal probability bins. Tables 01 and 
01 present the results for n — 50, m = 40 and n = 100, ni = 50, respectively. For the large 
sample the number of bins was increased to 10. 

It is apparent that none of the considered tests performs better than all other tests for 
all alternatives. The results indicate that the power of the energy test in most of the cases 
is larger than that of the well known and KS tests and comparable to that of the CvM 
test. For long tailed distributions, e.g. for combinations {fs,f4), the energy test is the 
most powerful test. This is not unexpected since R{x) = — hi(x) is long range. Lepage and 
Wilcox tests are powerful tests for all combinations and sample sizes considered, however. 
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Table 2: Power of the selected tests for n=in=25, a = 0.05, x — » + tx 
Pi Pi B,t KS CvM W L ^25,25 F 



0.4 

0.5 


1.4 0.12 0.18 0.48 0.38 0.24 0.11 
1.6 0.37 0.41 0.87 0.69 0.54 0.17 
0.8 0.40 0.55 0.66 0.50 0.45 0.52 
0.5 0.70 0.70 0.93 0.86 0.85 0.85 


0.4 

J7(x) h[x) Q g 
0.5 


1.4 0.08 0.13 0.22 0.14 0.13 0.08 
1.6 0.20 0.29 0.46 0.34 0.31 0.14 
0.8 0.34 0.46 0.57 0.51 0.44 0.45 
0.5 0.72 0.69 0.93 0.93 0.89 0.88 


0.4 

h[x) Mx) Q g 
0.5 


1.4 0.17 0.23 0.22 0.19 0.19 0.14 
1.6 0.33 0.44 0.42 0.37 0.38 0.24 
0.8 0.64 0.70 0.67 0.66 0.67 0.60 
0.5 0.74 0.77 0.84 0.91 0.89 0.84 


0.4 

h[X) h{x) g g 

0.5 


1.4 0.06 0.09 0.19 0.13 0.12 0.07 
1.6 0.14 0.22 0.35 0.29 0.21 0.11 
0.8 0.46 0.59 0.65 0.59 0.45 0.54 
0.5 0.71 0.72 0.89 0.88 0.82 0.85 


0.4 

£ / \ £ f \ 0.6 
/6(a;) h{x) Qg 

0.5 


1.4 0.10 0.16 0.15 0.12 0.12 0.12 
1.6 0.16 0.25 0.24 0.22 0.16 0.20 
0.8 0.95 0.94 1.00 0.97 0.98 0.97 
0.5 0.99 0.98 1.00 1.00 1.00 1.00 


0.4 
6 

h{x) fsix) g'g 
0.5 


1.4 0.26 0.34 0.30 0.25 0.28 0.21 
1.6 0.55 0.64 0.56 0.53 0.50 0.43 
0.8 0.85 0.89 0.86 0.84 0.85 0.79 
0.5 0.90 0.93 0.94 0.97 0.95 0.91 


0.4 

fsix) f4{x) g'g 

0.5 


1.4 0.34 0.42 0.45 0.52 0.54 0.32 
1.6 0.60 0.67 0.68 0.77 0.80 0.51 

0.8 0.80 0.85 0.78 0.72 0.82 0.70 
0.5 0.81 0.84 0.81 0.71 0.84 0.72 



the Lepage test is based on the first two moments of the null distribution and therefore 
specifically adapted to the type of study presented here. 

3.2 Mult i- dimensional case 

For the general multivariate two-sample problem, only a few binning-free tests have been 
proposed. The Friedman-Rafsky test and the nearest neighbor test like the energy test are 
based on the distance between the observations. The Bowman- Foster goodness-of-fit test 
uses the probability density estimation (PDF) to deduce a p.d.f. from a sample. If this 
technique is applied to both samples, it obviously can be used as a two sample test. With a 
Gaussian Kernel it is almost identical to the energy test with a Gaussian distance function. 
We prefer the logarithmic function. 

3.2.1 Friedman-Rafsky test 

The Friedman-Rafsky test can be seen as a generalization of the univariate run test. The 

problem in generalizing the run test to more than one dimension is that there is no unique 
sorting scheme for the observations. The minimum spanning tree can be used for this 
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Table 3: Power of the selected tests for n=50, m=40, a = 0.05, x ^ 6 + tx 



Pi 


P2 




r 


KS 


CvM 


W 


L 


*I'50,40 


x' 


fii-r) 


.fr(.r) 


0.3 

0.4 


1.3 

().<S 


0.22 
0.49 


0.18 
0.47 


0.67 
U.()7 


0.41 
0.53 


0.25 
U.4() 


0.14 
().G2 




.f2{x) 


0.3 


1.3 


0.15 


0.17 


0.34 


0.16 


0.20 


0.10 


0.4 


0.8 


0.62 


0.56 


0.66 


0.70 


0.58 


0.58 


h{x) 


Mx) 


0.3 


1.3 


0.28 


0.29 


0.25 


0.22 


0.26 


0.14 


0.4; 


0.8 


0.67 


0.66 


0.65 


0.70 


0.68 


0.61 


h{x) 


f9ix) 


0.3 


1.3 


0.07 


0.07 


0.27 


0.12 


0.14 


0.07 


0.4 


0.8 


0.51 


0.50 


0.66 


0.58 


0.46 


0.61 


h{x) 


h{x) 


0.3 


1.3 


0.13 


0.14 


0.18 


0.12 


0.11 


0.14 


0.4; 


0.8 


1.00 


0.97 


1.00 


0.99 


1.00 


1.00 


h{x) 


h{x) 


0.3 


1.3 


0.38 


0.39 


0.32 


0.29 


0.31 


0.25 


0.4 


0.8 


0.84 


0.84 


0.80 


0.78 


0.86 


0.74 


h{x) 


fi{x) 


0.3; 


1.3 


0.50 


0.52 


0.59 


0.66 


0.64 


0.39 


0.4; 


0.8 


0.76 


0.77 


0.70 


0.63 


0.79 


0.59 



Tabic 4: Power of the selected tests for n=100. m=50, a = 0.05. x + tx 



Pi 


Pi 







KS 


CvU 




L 




\" 


Mx) 


Mx) 


0.3 
0.4 


1.3 
0.8 


0.32 
0.68 


0.33 
0.73 


0.97 
0.85 


0.62 
0.76 


0.44 
0.76 


0.28 
0.66 


frix) 


Mx) 


0.3 
0.4 


1.3 
0.8 


0.21 
0.82 


0.27 
0.79 


0.67 
0.82 


0.28 
0.90 


0.33 
0.82 


0.26 
0.64 


f2{x) 


Mx) 


0.3 
0.4 


1.3 
0.8 


0.31 
0.85 


0.37 
0.86 


0.41 
0.82 


0.29 
0.90 


0.34 
0.89 


0.17 
0.72 


f2{x) 


Mx) 


0.3 

0.4 


1.3 

0.8 


0.08 

0.65 


0.10 

0.66 


0.51 
0.79 


0.18 
0.77 


0.21 
0.67 


0.17 
0.63 


h{x) 


Mx) 


0.3 
0.4 


1.3 
0.8 


0.13 
1.00 


0.19 
1.00 


0.25 
1.00 


0.18 
1.00 


0.18 
1.00 


0.22 
1.00 


Mx) 


Mx) 


0.3 

0.4 


1.3 

0.8 


0.46 

0.95 


0.52 
0.97 


0.44 

0.92 


0.44 

0.94 


0.47 

1.00 


0.25 

0.86 


Mx) 


Mx) 


0.3 
0.4 


1.3 
0.8 


0.61 
0.90 


0.68 
0.93 


0.81 
0.87 


0.84 
0.86 


0.86 
0.92 


0.63 
0.74 
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Table 5: Two dimensional distributions used to generate the samples. 



C8-SG 


pX 


1 
1 


AT ( C\ T 




AT ( C\ T 


3 


7V(0,I 


4 


7V(0,I 





wi'n T 


6 


7V(0,I 


7 


[7(0,1 


8 


{7(0,1 


9 


{7(0,1 


10 


{7(0,1 


11 


{7(0,1 


12 


{/(0,1 


13 


{7(0,1 


14 


{/(0,1 



0, 



C(0,I) 

^log(0,I) 

1 

' 0.6 1 
1 0.9 
0.9 1 



Student's 



t2 



CJ(IO) 

CJ(5) 

CJ(2) 

CJ(1) 
CJ(0.8) 
CJ(0.6) 

80%{7(0, 1) + 20%iV (0.5,0.0521) 
50%{/(0, 1) + 50%N (0.5,0.221) 



purpose. It is a graph which connects all observations in such a way that the total Euclidean 
length of the connections is minimum. Closed cycles are inhibited. The minimum spanning 
tree of the pooled sample is formed. The test statistic Rnm equals the number of connections 
between observations from different samples. 

Obviously, in one dimension the test reduces to the run test. Small values of i?„mlead 
to a rejection of Hq. The statistic Rnm is asymptotically distribution- free under the null 
hypothesis Henze and Penrose (1998)] . 



3.2.2 The nearest neighbor test 

The nearest neighbor test statistic Nnm is the sum of the number of vectors of the pooled 
sample (Zi, . . . , Z„+m) where the nearest neighbor of Z^, denoted by N{Zi), is of the same 
type as Z^: 

n+m 



Nnm = I (Zi and N{Zi) belong to the same sample) 

1=1 

Here / is the indicator function. N{Zi) can be determined by a fixed but otherwise 
arbitrary norm on R'' . We selected the Euclidean norm. In | Henze (1988)| it is shown that 
the limiting distribution of Nnm is normal in the limit min(n, m) — > 00 and n/(n + m) —f r 
with < r < 1. Large values of iV,„„ lead to a rejection of Hq. 

3.2.3 Comparison of the tests 



In order to investigate how the performance of the tests using ^nm , Rnm and Nnm changes 
with the dimension, we have considered problems in dimensions d = 2 and 4. In Table 
|S1 and Table we summarize the alternative probability distributions P'^ and from 
which we drew the two samples. The first sample was drawn either from Af(0,I) or from 
{7(0, 1) where N(fi, V) is a multivariate normal probability distribution with the indicated 
mean vector /i and covariance matrix V and U{0, 1) is the multivariate uniform proba- 
bility distribution in the unit cube. The parent distributions of the second sample were 
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Table 6: Four dimensional distributions used to generate the samples. 

case W 

~ moj) WJ) 

2 iV(0,I) iViog(0,I) 

3 A^(0,I) 80%^(O,I)+20%A^(O,0.22l) 

/ 1 0.4 0.5 0.7 \ 



7V(0,I) 50%iV(O,I)+50%iV 



Student's 



5 


iV(0,I) 


6 


7V(0,I) 


7 


C/(0,1) 


8 


c/(o,i) 


9 


c/(o,i) 


10 


U{0,1) 


11 


U{0,1) 


12 


U{0,1) 


13 


C/(0,1) 


14 


C/(0,1) 



0.4 1 0.6 0.8 

' 0.5 0.6 1 0.9 

\ 0.7 0.8 0.9 1 J 

t2 



CJ(IO) 

CJ(5) 

CJ{2) 

CJ{1) 
CJ(0.8) 
CJ(0.6) 

80%C/(0, 1) + 20%Ar(0. 5,0.0521) 
50%;7(0, 1) + 50%iV(O.5,0.22l) 



the Cauchy distribution C, the A^iog distribution (explained below), correlated normal dis- 
tributions, the Student's distributions <2 and ^4 and Cook- Johnson CJ{a) distributions 
I Devroye (1 986 with correlation parameter a > 0. CJ{a) converges for a ^ 00 to the 
independent multivariate uniform distribution and a — s- corresponds to the totally corre- 
lated case Xii = Xi2 = ... = Xid, i = 1, ...,n. We generated the random vectors from CJ{a) 
via the standard gamma distribution with shape parameter a, following the prescription 
proposed by | Ahrens and Dieter (1977) . The distribution denoted by iViog is obtained by 
the variable transformation x \n\x\ applied to each coordinate of a multidimensional 
normal distribution and is not to be mixed up with the log-normal distribution. It is ex- 
tremely asymmetric. Some of the considered probability densities have also been used in a 
power study in | Bahr (1996) 



The various combinations emphasize different types of deviations between the popula- 
tions. These include location and scale shifts, differences in skewness and kurtosis as well 
as differences in the correlation of the variates. 

The test statistics ^nm, Rnm and Nnm were evaluated. 

The power was again computed for 5% significance level and samples of equal size n = 
m = 30, 50, and 100 (small, moderate and large) in two and four dimensions. Tableland 
Table |S1 illustrate the power of the three considered tests calculated from 1000 replications. 

The Friedman-Rafsky and the nearest neighbor tests show very similar rejection power. 
For all three sample sizes and dimensions the energy test performed better than the other 
two tests in almost all considered alternatives. This is astonishing because the logarithmic 
distance function is long range and the probability distributions in the cases 11 and 12 have 
a sharp peak in one corner of a c? dimensional unit cube and in case 13 a sharp peak in 
the middle of this unit cube. The multivariate student distribution represents very mild 
departures from normality, but nevertheless the rejection rate of the energy test is high. 
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Table 7: Power at significance level a =0.05, calculated from 1000 repetitions, n = m = 30, 

n = m = 50 and n = m = 100, d = 2 



case 


■^30,30 


■^30,30 


^30,30 


-^50,50 


-^50,50 


^50,50 


1 


0.25 


0.23 


0.57 


0.44 


0.41 


0.86 


2 


0.33 


0.30 


0.58 


0.53 


0.50 


0.89 


3 


0.14 


0.12 


0.13 


0.17 


0.20 


0.21 


4 


0.63 


0.57 


0.53 


0.87 


0.83 


0.87 


5 


0.14 


0.14 


0.32 


0.18 


0.20 


0.49 


6 


0.07 


0.07 


0.11 


0.08 


0.08 


0.13 


7 


0.04 


0.07 


0.05 


0.04 


0.07 


0.05 


8 


0.05 


0.06 


0.08 


0.05 


0.08 


0.06 


9 


0.08 


0.08 


0.10 


0.08 


0.10 


0.14 


10 


0.13 


0.12 


0.15 


0.18 


0.18 


0.23 


11 


0.15 


0.14 


0.18 


0.23 


0.22 


0.30 


12 


0.20 


0.20 


0.25 


0.33 


0.31 


0.45 


13 


0.11 


0.10 


0.14 


0.16 


0.15 


0.33 


14 


0.09 


0.09 


0.14 


0.12 


0.11 


0.22 



case i?ioo,ioo -/Viooaoo $100,100 



1 


0.70 


0.60 


1.00 


2 


0.82 


0.74 


1.00 


3 


0.31 


0.28 


0.47 


4 


0.99 


0.97 


1.00 


5 


0.34 


0.29 


0.86 


6 


0.10 


0.11 


0.24 


7 


0.04 


0.05 


0.10 


8 


0.05 


0.05 


0.09 


9 


0.10 


0.11 


0.24 


10 


0.25 


0.23 


0.52 


11 


0.32 


0.29 


0.66 


12 


0.56 


0.48 


0.90 


13 


0.23 


0.19 


0.78 


14 


0.16 


0.16 


0.56 
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Table 8: Power at significance level a =0.05, calculated from 1000 repetitions, n = m = 30, 

n = m = 50 and n = m = 100, d = 4 



case 


■^30,30 


■^30,30 


^30,30 


-^50,50 


-^50,50 


^50,50 


1 


0.15 


0.19 


0.68 


0.22 


0.39 


0.93 


2 


0.46 


0.51 


0.90 


0.68 


0.78 


1.00 


3 


0.12 


0.13 


0.23 


0.14 


0.17 


0.47 


4 


0.08 


0.17 


0.13 


0.09 


0.26 


0.18 


5 


0.16 


0.21 


0.73 


0.22 


0.35 


0.95 


6 


0.06 


0.07 


0.17 


0.08 


0.10 


0.31 


7 


0.05 


0.06 


0.06 


0.06 


0.06 


0.07 


8 


0.06 


0.07 


0.08 


0.06 


0.09 


0.07 


9 


0.10 


0.12 


0.18 


0.13 


0.20 


0.30 


10 


0.16 


0.26 


0.30 


0.27 


0.42 


0.62 


11 


0.23 


0.37 


0.45 


0.39 


0.58 


0.77 


12 


0.35 


0.51 


0.65 


0.58 


0.76 


0.93 


13 


0.15 


0.16 


0.27 


0.20 


0.20 


0.62 


14 


0.11 


0.13 


0.17 


0.14 


0.18 


0.31 



case -Rioo,ioo -/Vioo,ioo $100,100 



1 


0.47 


0.73 


1.00 


2 


0.93 


0.98 


1.00 


3 


0.28 


0.25 


0.94 


4 


0.15 


0.46 


0.49 


5 


0.45 


0.63 


1.00 


6 


0.12 


0.16 


0.62 


7 


0.06 


0.08 


0.10 


8 


0.05 


0.09 


0.12 


9 


0.19 


0.29 


0.60 


10 


0.49 


0.66 


0.97 


11 


0.69 


0.84 


0.99 


12 


0.88 


0.96 


1.00 


13 


0.31 


0.29 


0.99 


14 


0.22 


0.23 


0.68 
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4 SUMMARY 



We have introduced the statistic energy which is a simple function of the distances of 
observations in the sample space. It can be used as a powerful measure of the compatibility 
of two samples. It is easy to compute, efficient and applicable in arbitrary dimensions of 
the sample space. The comparison to the Friedman-Rafsky and the nearest neighbor tests 
demonstrates its high performance in the multi-dimensional case. 



5 APPENDIX 

We define the energy of the difference of two probability density functions by 

j [/W - /oW] [/(^') - /o(x')] i?(x,x')dxdx'. 

Here and in what follows, an unspecified integral denotes integration over R''. Substi- 
tuting g(x) =/(x) — /o(x) we obtain 

</'-^yy5(x)5(x')i?(x,x')dx(ix'. (1) 

We replace in the Eq.QJ the distance function i?(x,x') = i?(|x — x'|) by its Fourier 
integral 

i?{|x - x'l) = j F(k)e*'^-("-"')(ik 



and obtain 



0-^1 j j 5(x)g(x')F(k)e''' ('^-')dxdx'dk 
= \j |G(k)|^F(k)dk. (2) 
where G'(k) is the Fourier transform of (/(x). 

For the function i?(|r|) = R{r) = with d > k, where d is the dimension of r, the 
Fourier transformation F(k) is Gel'fand and Shelov (1964)| : 

F(k) = 2'^-«7r'*/2_L22^K-d 

r(f) 

with k = |k|. 

From Eq.Q follows that (f) is positive. The minimum (pmin = is obtained for f;min(x) = 
or /(x) = /o(x). The result (7min(x) = holds also for the logarithmic distance function 
i?(r) = — Inr which can be considered as the k — limit of the power law distance function: 

— In 7' = lim 71 — ^ 1 ) 
n-^oo \\rj I 

The additional constant in the distance function does not contribute to the integral (Q). 
Expanding Q we get a sum of three expectation values of i?(x,x') 

'^^IJ 1 1^^'')^^''') - 2/o(x)/(x') + /o(x)/o(x')] i?(x,x')dxdx' 



11 



which can be estimated from two samples drawn from / and /o, respectively, as defined in 



The quantity in the brackets is up to a minor difference in the denominators equal to 
our test quantity ^nm- In the limit n, m ^ oo the statistic cpnm is minimum if the two 
samples are from the same distribution. 
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